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Abstract 

Background: Drug susceptible clinical isolates of Candida albicans frequently become highly tolerant to drugs 
during chemotherapy, with dreadful consequences to patient health. We used RNA sequencing (RNA-seq) to 
analyze the transcriptomes of a CDR (Candida Drug Resistance) strain and its isogenic drug sensitive counterpart. 

Results: RNA-seq unveiled differential expression of 228 genes including a) genes previously identified as involved 
in CDR, b) genes not previously associated to the CDR phenotype, and c) novel transcripts whose function as a 
gene is uncharacterized. In particular, we show for the first time that CDR acquisition is correlated with an 
overexpression of the transcription factor encoding gene CZF1. CZF1 null mutants were susceptible to many drugs, 
independently of known multidrug resistance mechanisms. We show that CZF1 acts as a repressor of (3-glucan 
synthesis, thus negatively regulating cell wall integrity. Finally, our RNA-seq data allowed us to identify a new 
transcribed region, upstream of the TAC1 gene, which encodes the major CDR transcriptional regulator. 

Conclusion: Our results open new perspectives of the role of Czfl and of our understanding of the transcriptional 
and post-transcriptional mechanisms that lead to the acquisition of drug resistance in C. albicans, with potential for 
future improvements of therapeutic strategies. 
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Background 

The yeast Candida albicans is the major cause of oppor- 
tunistic fungal infections in humans. In case of systemic 
infections, the mortality rate can reach 50% [1]. Azoles, 
which target the fungal P450 cytochrome 14alpha- 
lanosterol demethylase encoded by the ERG 11 gene, are 
the most commonly used antifungal molecules for candi- 
dosis treatment [2]. Unfortunately azoles only have a fun- 
gistatic effect and therefore have allowed the emergence 
of multidrug resistance strains in patients [3]. There are 
two main groups of mutations which cause azole resist- 
ance in Candida albicans. The first group directly targets 
ERG 11, either in cis by creating ERG 11 alleles which en- 
code a protein variant insensitive to azoles [4], or in trans 
by increasing the expression of ERG 11 through gain of 
function mutations in the UPC2 gene, which encode a 
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transcription factor regulating ERG1 1 [5]. The second type 
acts by increasing the expression of membrane transpor- 
ters which export the drugs and therefore decrease their 
intracellular concentration [5]. Two types of drug resist- 
ance, called MDR and CDR respectively, are distinguished, 
depending on the type of transporters which is involved. 
In most of the clinical isolates studied so far, the cells 
present either a drug resistance of the MDR or the CDR 
type [5,6]. Only in rare occurrences, the two kind of resist- 
ance can be observed in the same cells [6-8]. The MDR 
resistant cells overexpress the Mdrl permease encoding 
gene, which is under the control of the Mrrl and Capl 
transcription factors [9]. MDR1 is homologous to the Sac- 
charomyces cerevisiae FLR1 transporter, which is involved 
in the detoxification of several drugs, including the anti- 
fungal benomyl [8,10,11]. The CDR resistant strains over- 
express two ABC transporters encoding genes, CDR1 and 
CDR2, which are homologous to the pleiotropic drug re- 
sistance transporter Pdr5 from S. cerevisiae [12,13]. These 
transporters are major determinants of the resistance to 
fluconazole, the most widely used azole. Their expression 
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is controlled mainly by the Tacl transcription factor [14]. 
Most of the CDR resistant strains present a loss of hetero- 
zygocity and/or aneuploidy at the TAC1 locus, combined 
to gain of function mutations of TAC1 [15,16]. Microarray 
and proteomic analyses identified about 30 genes which 
are co-overexpressed with CDR1 and CDR2 in all CDR 
clinical isolates examined [17,18]. Chromatine immuno- 
precipitation and taclA strain analyses identified the 
TAC1 regulon, which is composed of about ten genes 
mostly involved in membrane properties, including CDR1, 
CDR2, the phospholipid transferase gene PDR16, the pu- 
tative flippase gene RTA3 or the putative sphingosine kin- 
ase gene LCB4 [18,19]. 

The recently developed RNA sequencing (RNA seq) 
technology is revolutionizing our ability to analyse 
eukaryotic transcriptomes [20-22] and has given a chance 
to unravel the actual complexity of the CDR phenotype 
better. RNA seq has been proven to be more sensitive 
than microarrays [21]. RNA seq also results in more ac- 
curate measurements of gene expression changes [23]. 
Moreover, relative to standard microarray approaches, 
RNA seq and tiling arrays provide crude measurement 
and identification of transcripts, without any a priori on 
which region of the genome is transcribed [24,25] . Hence, 
two recent RNA seq and one tiling array analyses of the 
Candida albicans transcriptome identified more than 
1000 new transcripts, many of which are expressed in a 
condition-specific way [25-27]. Most of these transcripts 
do not have a coding potential and may be long non cod- 
ing regulatory RNAs. 

In this study, we have used RNA seq to analyse the 
transcriptomes of a CDR strain and its isogenic drug 
susceptible counterpart. In addition to the genes previ- 
ously shown to be associated with CDR, we could iden- 
tify about 50 genes which were overexpressed in the 
CDR strain. In particular, we show that the transcription 
factor encoding gene CZF1, which is involved in hyphal 
transition and white/opaque switching, is induced to- 
gether with CDR1 and CDR2 in Gu5. Czfl is likely to 
play an important role in CDR acquisition since its over- 
expression is a general feature of all the CDR strains that 
we have tested, but was not found in MDR strains. 
Moreover, its deletion caused susceptibility to several 
unrelated drugs. Additionally, the inactivation of CZF1 
increased the resistance of the cells to cell wall pertur- 
bating agents, through the overexpression of beta glucan 
synthesis genes. We propose that Czfl has a positive role 
on drug resistance and a negative role on cell wall integ- 
rity. Finally, we characterized a new transcribed region, 
previously undetected, just upstream of the TAC1 gene, 
which strongly suggests that TAC1 is subjected to com- 
plex post- transcriptional regulations, yet to be character- 
ized. Taken together, our results open several new ways 
to our understanding of drug resistance acquisition in 



Candida albicans and may provide new targets for anti- 
fungal therapies. 

Results 

Transcriptional landscape of Gu4 and Gu5 strains 

In order to conduct a comparative analysis of the tran- 
scriptomes of CDR versus drug susceptible C. albicans 
cells, we performed high-throughput sequencing of 
cDNA made from poly(A) RNAs obtained from the Gu4 
and Gu5 strains. Gu4 is a fluconazole susceptible clinical 
isolate obtained from an early infection episode. Gu5 is 
the corresponding fluconazole resistant clinical isolate 
obtained from later episode in the same patient treated 
with fluconazole. Gu4 and Gu5 are therefore supposed 
to be isogenic and mainly differ by their resistance to 
fluconazole. Gu5 has been characterised as a CDR strain, 
showing overexpression of CDR1 and CDR2, as com- 
pared with Gu4 [28]. For each strain, two biologically in- 
dependent samples of RNAs were analysed by RNA seq. 
We obtained 27 763 991 unique mappable reads for Gu4 
(96% of total reads) and 20 570 932 mappable reads for 
Gu5 (97% of total reads). As noticed in previous RNA 
seq studies, the background was very low, with no or 
few reads mapped in intergenic regions. The correlation 
coefficient between the biological replicates was very 
high (0.991 for Gu5 and 0.999 for Gu4). We could detect 
expression for 5727 (92%) of the 6177 ORFs annotated 
in the CGD. The 450 ORFs which were undetected in 
our experiments are mostly dubious ORFs or gene with- 
out clear function (71%). We arbitrarily grouped the 
expressed ORFs in five categories: very high expression 
(more than 1000 reads/nucleotides), high expression (be- 
tween 100 and 1000 reads/nucleotides), medium expres- 
sion (between 10 and 100 reads/nucleotides), low 
expression (below 10 reads/nucleotides) and very low 
expression (below 3 reads/nucleotides). These categories 
contained 3%, 20%, 57%, 12% and 8% of the genes, re- 
spectively. As described previously, the "very high ex- 
pression" category was enriched in genes encoding 
ribosomal proteins, glycolytic enzymes, translation regu- 
lators, histones and components of respiratory com- 
plexes. The "high expression" category was enriched in 
genes encoding enzymes involved in various metabo- 
lisms. The "medium" and "low" expression categories 
did not show any significant functional enrichment. 

We used our data to define the boundaries of each 
transcript, as described previously [25]. We compared 
the 5' and 3'UTR predictions that we obtained with the 
annotation set up by a previous RNA seq analysis of the 
Candida albicans reference strain SC5314 [25]. We 
found no significant difference in the two annotations 
for more than 75% of the genes, indicating that our data 
is reliable and that there is no global change in 5' and 
3'UTR length between those strains. Still, we had some 
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significant discrepancies with the annotation from Bruno 
et al for about thirty genes, which showed much longer 
5' or 3' UTR according to our data (Additional file 1: 
Table SI). This is for instance the case of TAC1, ADAEC 



and CZF1 (Figure 1). These genes did not show any par- 
ticular functional enrichment. Noteworthy, we did not 
detect significant differences in the size of transcripts be- 
tween the Gu4 and Gu5 strains, with the notable and 
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Figure 1 Expression profiles obtained for TACT, ADAEC and CZFL The density of reads along the genome has been visualized using 
Integrative Genome Viewer (IGV) and the CGD annotation. The signal is in reads/nucleotides. The coordinates and the transcript annotations 
previously available [25] are indicated below each IGV capture. TAG and ADAEC present extension in 5' in our data as compared with previous 
annotations. In the case of TAG, this extension was detected only in the Gu5 strain. 
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sole exception of TAC1 which presented an extension in 
5' which was apparently specific for the azole resistant 
strain (Figure 1). 

Finally, we used our data to detect new transcripts. To 
do so, the reads from the Gu4 and Gu5 strains were 
summed and used to identify regions of the genome 
with a minimum coverage of at least 3 reads/nucleotides 
(with at least one read in three independent samples) for 
a stretch of at least 50 nucleotides. With these criteria, 
we identified 1130 new transcription units, 800 of which 
partially or totally overlapped new transcribed regions 
recently identified by RNA seq [25,26] or tiling arrays 
[27] (Additional file 1: Table S2). We named these new 
transcripts NGuTs for "New Gu strains Transcripts". 
The median size of NGuTs was 425 nucleotides. The 
majority of these NGuTs did not contain any obvious 
ORF and are therefore likely to be non-coding RNAs. 



Transcripts differentially expressed in the drug resistant 
and the drug susceptible strains 

The identification of genes differentially expressed in 
Gu4 and Gu5 was performed combining two different 
statistical methods (see methods section) and using the 
gene annotations from the CGD, the new transcribed 
regions defined by previous studies [25-27] and the new 
transcribed regions defined in this study (see above). We 
found 130 genes significantly overexpressed in Gu5 and 
92 genes which were underexpressed in this strain com- 
pared to Gu4 (a full list and description of these genes 
can be found in Additional file 1: Table S3). 



Comparison with previous microarray experiments 

We first compared our results with the one obtained in 
a previous study, which used microarrays to analyse the 
transcriptomes of 4 different CDR isolates [18]. Among 
the 130 genes induced in Gu5, 74 (56.5%) were previ- 
ously shown to be overexpressed in one or several of 
these CDR clinical isolates (Additional file 1: Table S3). 
Among these 74 genes, 53 (72%) were found overex- 
pressed in at least 3 of the 4 CDR strains, suggesting 
that most of the genes identified in our RNA seq ana- 
lyses are general features of the CDR isolates. These 
genes include the CDR1 and CDR2 ABC transporter en- 
coding genes, together with the TAC1 gene encoding 
their transcriptional regulator. Remarkably, all the previ- 
ously identified targets of Taclp are found in our list, in- 
cluding PDR16, RTA3 and LCB4. 

Of the 92 genes underexpressed in Gu5, 42 (46%) were 
previously shown to be repressed in CDR isolates. This 
includes genes involved in iron and metal homeostasis 
(FTR1, SIT1, FET34 for instance) and amino acid metab- 
olism (MET3, OPT7, GAP4, SAM4 for instance). 



ORFs not previously associated with CDR 

We identified 53 ORFs which were overexpressed in 
Gu5 and were not previously associated with CDR 
phenotype. About half of them (45%) are uncharacter- 
ized and little information is available on their function. 

Among the characterized genes (Figure 2 and Add- 
itional file 1: Table S3), some of them were known as 
playing important roles in morphogenesis, hyphal 
growth and white/opaque switch. This is the case of the 
transcription factors Czfl and Sfl2, the kinase Sch9, the 
membrane sensor Gprl and its target Ecel, the signal 
transducer Srrl, the white specific gene of unknown 
function ADAEC and the cytoplasmic protein Whll. 
Some other induced genes are involved in cell wall 
maintenance and cell adhesion, like the mannosyl- 
transferase Mnn4 or the adhesion protein Alsl. Finally, 
two transporters encoding genes belong to this list: 
Yorl, which is homologous to an oligomycin resistance 
gene of S. cerevisiae and HgtlO, which encodes a gly- 
cerol permease involved in osmotic stress response. Re- 
markably, four of the CDR induced genes (CZF1, TAC1, 
ADAEC and or f 19. 67 13) present the particularity to have 
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Figure 2 New genes induced or repressed in the CDR isolate. 

This figure represents a subset of the genes (and associated 
functions) which were induced or repressed in the Gu5 strain 
compared with Gu4 and which were not previously associated to 
CDR acquisition. The complete list of genes up- or down-regulated 
in Gu5 is available in Additional file 1: Table S3. 



Dhamgaye et al. BMC Genomics 2012, 13:396 
http://www.biomedcentral.com/1471 -21 64/1 3/396 



Page 5 of 13 



a long transcribed region upstream of the start codon 
according to our RNA seq data, which could be inter- 
preted either as long 5' UTR or as new non coding tran- 
scripts (see Figure 1 and chapter below). 

Fifty genes were shown for the first time to be down- 
regulated in Gu5. Of these, 24 (49%) are uncharacter- 
ized. Among the verified ORFs (Figure 2 and Additional 
file 1: Table S3), we could find the gene encoding the 
ABC transporter Cdr4, which is homologous to Cdrl 
and Cdr2, and RTA4, which is homologous to the Tacl 
target, RTA3. We also found in this group additional 
genes involved in copper and iron homeostasis (HEM13, 
CRP1 and SMF3) and genes encoding extracellular pro- 
teins (PLB1, ALS2, MAL2). Two genes encoding poten- 
tial steroid targets, EBP1 and CBP1, were also 
underexpressed in Gu5, which is interesting considering 
the previously established link between steroid exposure 
and CDR response [29]. 

To support these results, we performed semi-quantitative 
RT-PCR analyses for 6 repressed and 8 induced genes, 
using actin as a reference and CDR1 as a positive control 
(Figure 3). These experiments confirmed the RNA seq 
results, except for EBP1 and orf 19.3769, for which no differ- 
ence of expression was detected by this method. 

New transcripts differentially expressed in Gu5 and Gu4 
strains 

Among the new transcripts identified in this study, 5 
were overexpressed and 8 underexpressed in Gu5 com- 
pared with Gu4. Four of the five induced transcripts 
were located immediately in 5' of an induced ORF. 
Transcript NGuT-chr4-123 (previously identified as 
NOVEL-Ca21chr4-073 in [25]) was in 5' of CZF1 
(Figure 1), NGuT-chr5-037 was in 5' of TAC1 (Figure 1), 
NGUT-chr2-038 was in 5' of ADAEC (Figure 1) and 



NGuT-chr3-128 (previously identified as TF3W223 in 
[27]), was close to orf 19.67 13. The fifth transcript, 
NGuT-chr3-105 (NOVEL-Ca21chr3-056), is located just 
upstream of the AAF1 gene, which encodes an adhesin 
and which is not induced in the Gu5 strain. 

All the new transcripts which were repressed in Gu5 are 
located at more than 1 kb from the adjacent ORFs. NGUT- 
chrR-157 (NOVEL-Ca21chrR-103) and 160 (TFRW269) 
belong to the same cluster of non coding RNAs. NGUT- 
chr4-069 (NOVEL-Ca21chr4-040) is located in a zone rich 
in tRNA genes. NGUT-chrR-041 (NOVEL- Ca21chrR-026) 
is close to a long terminal repeat. 

Semi-quantitative RT-PCR analyses for 4 repressed 
and 3 induced NGuTs, using actin as a reference con- 
firmed the RNA seq results, except for NGuT-chr3-105 
and NGUT-chrR-157 (Figure 3). 

New roles of the Czf 1 p transcription factor in multidrug 
resistance and cell wall maintenance 
CZF1 deletion alters drug resistance 

To go further in our investigation of the roles of new 
genes involved in the acquisition of CDR, we selected 
three of the induced genes, MNN4 (for mannosylpho- 
sphorylation), CZF1 (C. albicans Zinc Finger protein), 
and ALS1 (Agglutinin Like Sequence), and performed 
drug sensitivity profiling with the corresponding null 
mutants (Figure 4A). Serial dilutions of cells were grown 
on agar plates and minimum inhibitory concentration 
(MIC) was determined by broth microdilution assay with 
different concentrations of FLC, terbinafin (TER), aniso- 
mycin (ANS), amphotericin B (AMB), and ketoconazole 
(KTC) (Figure 4B). The homozygous mutants for MNN4 
and ALS1 did not exhibit any significant differences with 
their respective wild type strains. However, czfl null mu- 
tant displayed enhanced susceptibility to FLC, TER and 
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Figure 3 Confirmation of RNA seq results by semi-quantitative RT-PCR. Final point semi-quantitative RT-PCR was used to validate some of 
the genes found to be differentially expressed by RNA seq. CDR1 was used as a positive control of a gene induced in the Gu5 strain. ACT! was 
used as a reference gene equally expressed in Gu5 and Gu4. The genes for which the PCR results did not confirm the RNA seq results are marked 
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Figure 4 Phenotypic profiling of some strains mutated for 
genes induced in Gu5. (A). Serial dilution assays showing drugs 
susceptibilities of strains deleted for genes found to be induced in 
Gu5 and of their isogenic wild type parents. (B). MIC80 calculated for 
each drug and each strain. FLC, terbinamycin (TER), anisomycin 
(ANS), amphotericin B (AMB), and ketoconazole (KTC). 



ANS in comparison to its wild type strain (Figure 4A). 
MIC 80 of the czfl null strain was calculated as 0.12, 1 
and 3.12 (ig/ml for FLC, TER, and ANS, respectively, 
which was ~ 3 fold less than its isogenic wild type 
(Figure 4B). Notably, the strain heterozygous for the de- 
letion of CZF1 had wild type levels of drug resistance 
(data not shown), czfl nulls did not show any differential 
susceptibility to AMB or KTC (Figure 4). 



CZF1 deletion does not impair the expression of standard 
MDR and CDR actors 

To understand the molecular basis of the drug sensitivity 
described above, we measured, in the wild type and in 
strains mutated for one or both copies of CZF1, the ex- 
pression levels of major actors of multidrug resistance 
pathways (CDR1, MDR1, UPC2, PDR16, HSP104) or of 
genes changing expression in Gu5 in our experiments 
(CDR4, YOR1, WH11). Final point semi-quantitative RT- 
PCR did not show any difference in the level of these 
genes in the wild type and in strains mutated for one or 
both copies of CZF1, indicating that Czfl does not act 
on the steady state expression of MDR and CDR actors 
in a laboratory strain (Figure 5A). 



CZF1 negatively controls beta-glucan synthesis and 
resistance to cell wall perturbation agents 

The sensitivity of the czfl null mutant to various drugs 
prompted us to test the phenotype of this mutant in pres- 
ence of drugs targeting the calcineurin pathway or cell 
wall integrity. Calcineurin has been shown to play mul- 
tiple roles in stress responses including resistance to FLC 
and to membrane perturbation [30,31]. It has also been 
shown to influence the dimorphic switch, a process which 
is controlled by Czfl [31]. We tested growth of wild type, 
czflA/czflA or czflA/CZFl strains in presence of the 
membrane perturbation agent SDS and the specific calci- 
neurin inhibitor FK506. We observed that the double mu- 
tant was more sensitive to these drugs than the single 
mutant and the wild type strain (Figure 5C). We tested 
the expression of several direct or indirect actors of the 
calcineurin pathway (CMP1, CNB1, CRZ1, HSP90 and 
HOG1). We did not detect any effect of the partial or total 
inactivation of CZF1 on the expression of these genes 
(Figure 5B). Reciprocally, we tested the expression of 
CZF1 in various strains mutated for one or several actors 
of the calcineurin pathway. Again, we did not see any 
effects of the inactivation of the calcineurin pathway on 
the steady-state expression of CZF1 (data not shown). 

In C. albicans, the cell wall plays important roles in viru- 
lence, defense against host and resistance to some drugs 
[32]. Surprisingly, we observed that the czflA/czflA strain 
was more resistant than the wild type to congo red (a cell 
wall perturbation agent) (Figure 5C). The heterozygous mu- 
tant exhibited an intermediate phenotype (haploinsuffi- 
ciency), suggesting a dosage effect for the role of CZF1 in 
cell wall maintenance. Congo red is considered to act on 
the cell wall glucan structure. Therefore, we measured the 
expression of the three beta 1,3-glucan synthase encoding 
genes (GSC1, GSL1 and GSL2) in presence and absence of 
CZF1 (Figure 5B). We observed no or few differences in 
GSC1 and GSL2 expression. In contrast, we observed a 
clear overexpression of GSL1 in the strains deleted for one 
or two copies of CZF1, compared to the wild type. This re- 
sult suggests that Czfl negatively controls the expression of 
this gene and that the resistance to congo red that we 
observed in the czfl mutants could be linked to the partial 
deregulation of beta-glucan synthesis. Interestingly, a defect 
in beta-glucan activity has been shown to cause sensitivity 
to many different drugs, including azoles [33]. However, the 
expression of GSL1 } GSC1 and GSL2, tested by RNA-seq 
and semi-quantitative RT-PCR, was unchanged in the Gu5 
strain as compared with Gu4 (data not shown). 

CZF1 overexpression is commonly associated with CDR 
acquisition 

To check whether an upregulation of CZF1 is a general 
phenomenon among drug resistant clinical isolates, we 
examined the expression of CZF1 in six azole sensitive 
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Figure 5 Effects of CZF1 deletion on genes involved in CDR 7 calcineurin or beta-glucan biosynthesis pathways. The expression of several 
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(AS) and azole resistant (AR) matched pairs (Figure 6). 
The first group of AS/AR isolates includes strains 
(DSY347/DSY289; DSY544/DSY775 and Gu4/Gu5) 
in which the AR isolate is characterized by the 



overexpression of CDR1 and CDR2 (CDR group). The sec- 
ond group included strains (F2/F5; G2/G5 and DSY290/ 
DSY292) where AR isolates showed an overexpression of 
MDR1 (MDR group). Our quantitative RT-PCR data 
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DSY544 DSY775 DSY347 DSY289 Gu4 Gu5 ™ 1 ™ s TW9 ™ 17 



CZF1 
ACT1 




MIC J0 

(Hg/ml): 0.8 10 10 100 



MDR pairs (sensitive/resistant) 
F2 F5 G2 G5 DSY290DSY292 



CZF1 



ACT1 




Figure 6 CZF1 induction is commonly associated with CDR acquisition. The expression of CZF1 was measured by final point semi- 
quantitative RT-PCR in different pairs of drug sensitive/resistant strains presenting either a drug resistance of the CDR or of the MDR kind. On the 
right panel, the expression of CZF1 was measured in a series of isolates coming from the same patient at different step of fluconazole resistance 
acquisition [34]. Seventeen isolates were available from the original publication, four of them (numbers 1, 8, 9 and 17) were tested here. The 
MIC50 of each isolate for fluconazole is indicated. ACT! was used as a reference gene. 
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revealed that CZF1 was upregulated in two AR isolates of 
the CDR group and one of the MDR group (DSY290/ 
DSY292; DSY347/DSY289; Gu4/Gu5) (Figure 6, upper 
left). In contrast, the levels of CZF1 remained unchanged 
in the other pairs of isolates (Figure 6, bottom left). We 
next analyzed the expression of CZF1 in a series of clinical 
isolates acquired from a single HIV-infected patient over 
the period of two years where the levels of FLC resistance 
of the strain increased over 200-fold [34] (Figure 6, right 
panel). We observed a gradual overexpression of CZF1, 
which followed the increase in FLC resistance of those iso- 
lates (Figure 6, right panel). 

Taken together these results strongly suggest that the 
overexpression of CZF1 is generally correlated to the ac- 
quisition of the CDR phenotype, with the notable excep- 
tion of the DSY544/DSY775 pair. 

Discussion 

In this work, we have used RNA-seq to analyze the tran- 
scriptome changes associated with CDR acquisition in a 
pair of clinical isolates in the human pathogen C. albicans. 
Our aim was to obtain an accuracy, sensitivity and com- 
pletion unreached by previous microarray analyses of the 
same strains. Based on previous RNA-seq studies, we 
expected a better detection of mRNAs and of their expres- 
sion changes, the identification of unannotated transcripts 
possibly up or down regulated in the CDR strain, and the 
detection of changes in the 5' and 3' UTR lengths com- 
pared with available annotations. Indeed, our data pointed 
out about one hundred genes, which were either induced 
or repressed in the Gu5 strain compared with Gu4, and 
which were not associated to CDR acquisition by previous 
studies. Although we identified more than one thousand 
new transcripts (some previously annotated by RNA seq 
or tiling arrays, some completely new), very few of them 
changed expression in the CDR strain. Four of these new 
transcripts were found to be induced in Gu5, but their 
position just upstream of annotated genes, which are simi- 
larly induced in Gu5, raises doubts on their actual exist- 
ence as independent transcripts, or as long, previously 
undetected, 5' UTR of the downstream gene. 

The analyses of this large amount of data led to two 
important conclusions, which open new perspectives for 
our understanding of multidrug resistance in this patho- 
genic yeast. First, we revised the annotation of the TAC1 
genomic region, identifying a new transcribed region in 
5' of the TAC1 mRNA. Second, we suggested new roles 
for the Czfl transcription factor in CDR acquisition and 
cell wall maintenance. 

A new transcribed region upstream of TAC1 and its 
potential impact on TAC1 expression regulation 

RNA seq and tiling arrays experiments have allowed the 
systematic annotation of transcript boundaries in three 



yeast species: S. cerevisiae, C. albicans and C. parapsilo- 
sis [21,25,27,35]. In all three species, the 5' and 3'UTR 
have been shown to be generally short, with an average 
size ranging between 50 and 80 nucleotides. However, in 
all these species, many exceptions have been described. 
In C. albicans, about 100 genes have 5' UTR longer than 
500 bp [27]. Interestingly, this group is enriched in genes 
encoding transcriptional regulators and genes involved 
in filamentation. These include the CZF1 transcription 
factor encoding gene, which was induced in our experi- 
ments and which has a 5'UTR of about 2 kb ([27,36]; 
this study). This observation is also true in mammals in 
which many genes with long 5'UTR encode regulators 
and proto-oncogenes [37]. Long 5'UTR usually have im- 
portant roles in the post-transcriptional and translational 
regulation of gene expression. In the yeast Saccharo- 
myces cerevisiae, the expression of several transcription 
factors, including Gcn4 (amino acid starvation), Yapl 
(oxidative stress response) and Yap2 (cadmium re- 
sponse), is regulated by the presence of short upstream 
ORFs (uORFs) in their long 5'UTR region, which exert 
negative or positive actions on the translation of the 
main ORF or on mRNA stability, depending on the cel- 
lular context [38]. Recent studies have identified about 
250 genes containing potential uORFs in this species 
[21]. 

As mentioned in the introduction, TAC1 encodes the 
major transcriptional regulator of CDR genes. Its own 
mRNA expression is co-induced with those of its main 
targets, CDR1 and CDR2, possibly due to an autoregula- 
tory loop [18,29]. In previous transcript annotations, 
TAC1 mRNA was described as having a 5'UTR of about 
300 nt [25,27]. Our RNA seq data detected a transcribed 
region up to 1 kb upstream of the TAC1 start codon, 
which was apparently specific of the Gu5 strain 
(Figure 1). This could be interpreted as a new transcript 
expressed only in the CDR strain, as an alternative tran- 
scription start site (TSS) specific for Gu5 (alternative 
TSS had been described in C. albicans, for instance in 
the case of the EFG1 transcription factor encoding gene 
[39]) or as the normal TSS of TAC1 which could not be 
detected in drug sensitive strains because expression of 
TAC1 in those strains and RNA-seq coverage in previous 
experiments and in this study were too low. 5'RACE and 
northern blot experiments could resolve if it is an alter- 
native TSS or a new transcript. However, the strong cor- 
relation between the expression of the two regions 
(Figures 1 and 3) suggests that TAC1 mRNA presents, at 
least in the Gu5 strain, an unusually long 5'UTR, which 
was previously not detected. Considering the important 
roles of long 5'UTR in translation regulation, this opens 
exciting new perspectives to the post-transcriptional 
regulation of TAC1 expression, which was poorly investi- 
gated until now. Especially, we observed in this 5'UTR 
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the presence of one large (about 50 codons) uORF in 
the same orientation and phase than the TAC1 ORF 
(Figure 7), which suggests the existence of regulatory 
mechanisms similar to the ones described for GCN4, 
YAP1 or YAP2 in S. cerevisiae. The analysis of ribosome 
occupancy at TAC1 mRNA in CDR matched pairs, ei- 
ther measured by global ribosome profiling or by poly- 
somes gradients, will give information on this potential 
regulation. Moreover, a long 5'UTR would have import- 
ant implications on the way we consider the transcrip- 
tional regulation of TAC1. Indeed, yeast enhancers are 
usually located close upstream to the TSS. Since 5'UTR 
regions are also limited in size, people tended to look for 
DNA consensus motifs within 800 bp upstream of the 
start codon of the genes. The fact that the actual TSS of 
TAC1 would be located about 0.8 kb upstream of all pre- 
vious estimations also change the potential position of 
its promoter region. For instance, it was shown that 
Tacl binds its own promoter, which led to the hypoth- 
esis that it is regulated through a positive auto- 
regulatory loop [18]. However, this hypothesis was chal- 
lenged by the fact that the two potential Tacl binding 
motifs (DRE) in this region were located more than 1 kb 
upstream of TAC1 ATG. In the annotation proposed in 
Figure 7, these DRE would be located very close to the 
TSS, which strongly supports their functional role in the 
activation of TAC1 expression. 

New roles for Czfl in CDR acquisition and cell wall 
maintenance 

Czfl is a zinc finger transcription factor mostly known 
for its role in the epigenetic white /opaque switch, which 
controls cell and colony morphogenesis and properties 
[40]. Czfl acts synergistically to Worl and Wor2 tran- 
scription factors, and antagonistically to Efgl, to pro- 
mote the opaque state. Czfl also has a role in hyphal 
growth, (also called filamentation). Czfl mutants filament 
normally under standard filamentation induction condi- 
tions, but they are defective in hyphal development in an- 
aerobic conditions [41]. This phenotype is dependent on 
the composition of the media, for instance in terms of car- 
bon source. The positive role of Czfl on hypoxic filamen- 
tation is under the negative control of the TOR pathway 



and more specifically, of the Sch9 kinase [42]. As for the 
white/opaque switch, Efgl plays an important and com- 
plex role in filamentation. In aerobic conditions, it is a 
positive regulator of hyphal development, whereas in hyp- 
oxia, it counteracts the positive action of Czfl on this 
process [43]. Direct cross-talks exist between Efgl and 
Czfl: Efgl binds to the promoter of CZF1 and repress its 
expression, while Czfl has a similar effect on the EFG1 
promoter [36,44]. More recently, Czfl has been shown to 
be required for wild type adherence to silicone devices 
[45]. Despite these roles in morphogenesis, the list of Czfl 
targets has not been established to date. A partial tran- 
scriptome analysis of the czfl mutant, conducted on 250 
genes using the Nanochip technology, detected very few 
expression changes in this mutant [45] . 

Our data strongly suggest that Czfl plays a role in the 
acquisition of multidrug resistance of the CDR type. 
First, CZF1 is overexpressed in all CDR mutants that we 
tested. Noteworthy, some other actors of white/opaque 
switch and hyphal growth were also overexpressed in 
the Gu5 strain, including SCH9, ADAEC, WH11 or 
SFL2. Second, the czfl null mutant is sensitive to many 
unrelated drugs, including azoles and molecules target- 
ing the calcineurin pathway. Remarkably, CZF1 does not 
seem to be involved in drug resistance phenotypes of the 
MDR kind, since its expression was unchanged in MDR 
mutant strains. The drug sensitivity associated with the 
inactivation of Czfl does not seem to be due to standard 
drug resistance pathways, since the level of expression of 
the main actors of the CDR, MDR and calcineurin path- 
ways were unchanged in this mutant. However, the fact 
that Czfl does not control the steady-state expression 
levels of those genes does not mean that it is not 
involved in their induction following drug exposure. 
This hypothesis will be tested by inactivating CZF1 in 
the Gu5 and Gu4 backgrounds and testing the effect of 
this inactivation on drug resistance and gene expression. 

Surprisingly, we observed that the czfl mutant was re- 
sistant to congo red, a cell wall perturbation agent which 
targets beta-glucans, the most abundant component of 
Candida albicans cell wall. This phenotype was haploin- 
sufficient, since the heterozygous mutant exhibited an 
intermediate resistance between the null mutant and the 
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wild type. Congo red is thought to target the glucan 
architecture by directly interacting with glucan chains, 
therefore inhibiting synthesis of glucan polymers [46]. 
We observed that the GSLl gene encoding one of the 
three beta 1,3 glucan synthase subunits, was strongly 
overexpressed in the czfl mutant. This suggests that 
Czfl has a negative effect on the expression of some of 
the actors of the glucan synthesis pathway, which could 
explain the congo red resistance observed in the null 
mutant. Many functional connections have been estab- 
lished between cell wall components and hyphal growth. 
GSLl itself has been shown to be down- regulated during 
filamentation [47], a process that is positively regulated 
by Czfl. Notably, Efgl is also involved in cell wall archi- 
tecture and efgl mutants show a haploinsufficiency 
phenotype [47,48]. The deletion of one or two copies of 
EFGl leads to a decrease in the expression of many 
enzymes involved in cell wall biogenesis, including 
beta 1-3 glucan synthases. This causes important changes 
in cell wall thickness and composition [48]. Again, Czfl 
and Efgl seem to antagonize each other in the regula- 
tion of cell wall maintenance. The over-expression of 
some cell wall protein encoding genes has been exten- 
sively described as a compensatory mechanism in re- 
sponse to impairments in cell wall biosynthesis or to cell 
wall damages, both in S. cerevisiae and C. albicans (see 
for instance [49-51]). This phenomenon is largely con- 
trolled by the cell wall integrity (CWT) pathway [52] but 
part of it is independent of CWI [53]. For instance, treat- 
ment with caspofungin (which directly inhibits the activ- 
ity of the essential beta-glucanase subunit encoded by 
GSC1) triggers a compensatory process which leads to 
the overexpression of GSLl, similarly to what was 
observed in the czfl null mutant [51]. We suggest that 
Czfl could play a role in this process. 

Conclusions 

In conclusion, our results led us to propose two new 
roles for Czfl in drug resistance and cell wall mainten- 
ance (Figure 8). The connection between these two roles 
is unclear. Several cell wall proteins are induced in CDR 
strains (CRH11, ALS1, SCW11, PGA13, AGP2, IFF4, 
RBT5, EXG2, ECM331, CSP37, etc.) ([18]; this study). 
However, some CDR strains have cell wall defects, while 
others (including Gu5) seem to have normal cell wall 
architectures (Singh and Prasad, submitted). As men- 
tioned above, Gscl (previously known as Fksl) is the 
main target of antifungals of the echinocandin family 
(which includes caspofungin) and mutations in GSC1 
confer resistance to these drugs [54,55]. However, nei- 
ther GSLl nor GSC1 show any significant changes in the 
CDR strains and the GSLl overexpression in the czfl 
null mutant does not explain the large spectrum of drug 
sensitivity observed for this strain. The systematic 




Beta-glucane synthesis and cell wall 
architecture 



Figure 8 New roles for Czfl. Czfl had been shown to antagonize 
the role of Efgl in hypoxic filamentation and white/opaque 
switching. In this study, we suggest new roles for Czfl in MDR and 
in cell wall architecture. In this last role, Czfl would antagonize the 
positive role of Efgl on the expression of genes involved in cell wall 
biosynthesis and maintenance, as for instance in the case of GSLl. 



determination of the Czfl targets by transcriptomic ana- 
lyses of the czfl mutant and chromatine immunoprecipi- 
tation will certainly thrfow some light on its role in 
these two processes. 

Methods 

Material 

All the media components were obtained from HiMedia 
(Mumbai, India). Amphotericin B (AMB), terbinafin, 
anisomycin, ketoconazole, sodium dodecyl sulphate, 
FK520, congo red were purchased from Sigma chemicals 
Co. (St. Louis, MO). Ranbaxy India generously provided 
FLC. Reverse transcriptase kit was purchased from fer- 
mentas, India. Caspofungin was generously provided by 
Dr. Neeraj Chauhan (Public Health Research Institute 
and Department of Microbiology and Molecular Genet- 
ics, New Jersey, USA). dATP, dGTP, dTTP and dCTP 
were bought from Sigma chemicals Co. (St. Louis, MO). 

Yeast strains and growth conditions 

Gu4 and Gu5 strains were described elsewhere [28]. The 
strains of Candida albicans mutated in various genes 
(ALS1, MNN4, CZF1) that were used for phenotypic 
tests, were obtained from Dr. Ibrahim, A.S. (David 
Geffen School of Medicine at UCLA Los Angeles Bio- 
medical Research Institute), Dr. Gow, N. (The university 
of Aberdeen, Scotland) and Dr. Kumamoto, C.A. (Tufts 
University, Boston, USA) respectively. The complete list 
of strains can be found in Additional file 1: Table S4. 
Cells were grown in YPD rich medium (2% glucose, 1% 
bactopeptone, 1% yeast extract) at 30°C for 48 hrs. 
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RNA extraction 

Twenty ml of exponentially growing (OD = 0.6-0.7) Gu4 
or Gu5 cells were flash frozen in 30 ml of cold ethanol, 
centrifuged for 5 minutes at 3000 g, washed with sterile 
water and centrifuged again. The cell pellets were stored 
at -80°C. RNA were extracted using the RNeasy kit 
(Qiagen), with DNAse treatment. The RNA quality was 
checked on a bioanalyzer Nanochip (Agilent). 

RNA sequencing 

cDNA libraries and sequencing 

Strand non-specific cDNA libraries (mRNA-Seq 8-Sample 
Prep Kit) were prepared according to manufacturers 
instructions (Illumina). The validation of libraries was 
made with Bioanalyser DNA 1000 chip (Agilent). All li- 
braries were sequenced in single read mode, using a Gen- 
ome Analyser instrument (Illumina). 

Reads mapping and counting 

Before read mapping, we trimmed polyN read tails, 
removed reads with a length lower or equals to 11 bases 
and discarded reads with quality mean lower or equals 
to 12. We aligned from the 4 samples a total of 
48334923 bp single end reads produced from the Illu- 
mina GAII device using the SOAP2 software (version 
2.20) [56] on the Candida albicans genome (assembly 
21). The mapping tool was configured to return all 
alignment hits by setting the seed length to 28 bp. In 
addition we authorized with SOAP2 up to 5 mismatches 
with 2 in the seed. 

Before expression estimation, alignments from reads 
that matches more than one time on the reference gen- 
ome were removed. In addition, we searched for new 
transcripts by combining the Gu4 and Gu5 reads and 
identifying regions of the genome with a minimum 
coverage of at least 3 reads/nucleotides (with at least 
one read in three independent samples) for a stretch of 
at least 100 nucleotides. To compute gene expression 
abundance, we used Candida albicans GFF genome an- 
notation from assembly 21 together with the annotation 
file for new transcripts and we counted all overlapping 
regions between alignments and listed exons. The 
complete results have been deposited at the GEO data- 
base (accession number: GSE38298). 

Data mining 

Functional analyses of the genome-wide data were con- 
ducted using the CGD GO (gene ontology) term finder 
[57], with default parameters. 

Differential analyses 

Genes differentially expressed between Gu5 and Gu4 
strains were identified applying two different methodolo- 
gies available in the R programming language, i.e. DESeq 



[58] and edgeR [59,60]. Default parameters were used 
and genes with p- value lower than 0.01 with the two 
methods were finally selected for further analysis (Add- 
itional file 1: Table S3). 

Final point semi-quantitative RT-PCR analyses 

RT-PCR was done using the RevertAidTM H Minus kit 
(MBI, Fermentas). Briefly, 1 \ig isolated RNA was primed 
with oligo (dT)18 for cDNA synthesis at 42°C for 
60 min. Reverse transcription reaction was terminated 
by heating at 70°C for 5 min. The synthesized cDNA 
product (2 \A) was directly used for PCR amplification 
reaction (50 using gene specific forward and reverse 
primers. The amplified products were gel electrophor- 
esed and quantitated by using Quantity One software by 
Bio Rad gel documentation system. The RNA samples 
used for these validation experiments were independent 
from the samples used for RNA seq. The sequences of 
the primers used for PCR reactions can be found in 
Additional file 1: Table S5. 

Phenotypic tests 

Drug-susceptibility was tested by broth-microdilution 
assay according to CLSI (Clinical and Laboratory Stan- 
dards Institute) and serial dilution assay essentially as 
described previously (CLSI, 2008). Following drugs 
stocks (in parenthesis) and working concentration were 
used 2.5 ug/ml of FLC (5 mg/ml), 4 jig/ml of TER 
(4 mg/ml), 4 (ig/ml of AMB (4 mg/ml), 150 (ig/ml of 
ANS (20 mg/ml), 0.5 ug/ml of KTC (1 mg/ml), 50 ng/ml 
of CAS (1 mg/ml), 0.08% SDS, 200 ng/ml of CR (10 mg/ 
ml), and 200 ng/ml FK520 (20 mg/ml). 

Additional file 



Additional file 1: Table SI. Genes with enlarged 3' or 5' UTR 
boundaries as compared with the annotation of Bruno et al. 2010. Table 
S2. annotation and coordinates of new transcripts found based on our 
RNA seq data. Overlap with previously annotated transcripts are indicated 
(annotations taken from Candida Genome Database, Bruno et al., 201 0; 
Sellam et al., 2010 and Tuch et al., 2010). Table S3. CDR Induced genes. 
Genes that were found significantly overexpressed in Gu5 by Deseq and 
EdgeR. Log2(ratios) are indicated, together with CGD descriptions. The 
CDR isolates column indicate the number of different isolates in which 
the gene was previously found to be overexpressed, based on the 
studies of Liu et al., 2007 and Znaidi et al., 2006. Table S4. List of the 
strains used in the study. Table S5. Sequences of the primers used in the 
study. 
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